Overview

Dataset statistics

Number of variables22
Number of observations55381
Missing cells206603
Missing cells (%)17.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.3 MiB
Average record size in memory176.0 B

Variable types

Categorical10
Numeric8
Boolean1
Unsupported3

Alerts

socialEngagementType has constant value "Not Socially Engaged" Constant
totals.newVisits has constant value "1.0" Constant
trafficSource.isTrueDirect has constant value "True" Constant
totals.bounces has constant value "1.0" Constant
geoNetwork.country has a high cardinality: 173 distinct values High cardinality
trafficSource.source has a high cardinality: 108 distinct values High cardinality
totals.hits is highly correlated with totals.pageviews and 1 other fieldsHigh correlation
totals.pageviews is highly correlated with totals.hits and 1 other fieldsHigh correlation
totals.timeOnSite is highly correlated with totals.hits and 1 other fieldsHigh correlation
totals.hits is highly correlated with totals.pageviews and 1 other fieldsHigh correlation
totals.pageviews is highly correlated with totals.hits and 1 other fieldsHigh correlation
totals.timeOnSite is highly correlated with totals.hits and 1 other fieldsHigh correlation
totals.hits is highly correlated with totals.pageviews and 1 other fieldsHigh correlation
totals.pageviews is highly correlated with totals.hits and 1 other fieldsHigh correlation
totals.timeOnSite is highly correlated with totals.hits and 1 other fieldsHigh correlation
socialEngagementType is highly correlated with device.deviceCategory and 7 other fieldsHigh correlation
device.deviceCategory is highly correlated with socialEngagementType and 4 other fieldsHigh correlation
totals.newVisits is highly correlated with socialEngagementType and 7 other fieldsHigh correlation
trafficSource.isTrueDirect is highly correlated with socialEngagementType and 7 other fieldsHigh correlation
trafficSource.medium is highly correlated with socialEngagementType and 4 other fieldsHigh correlation
device.browser is highly correlated with socialEngagementType and 4 other fieldsHigh correlation
totals.bounces is highly correlated with socialEngagementType and 7 other fieldsHigh correlation
channelGrouping is highly correlated with socialEngagementType and 4 other fieldsHigh correlation
device.operatingSystem is highly correlated with socialEngagementType and 5 other fieldsHigh correlation
channelGrouping is highly correlated with trafficSource.mediumHigh correlation
totals.hits is highly correlated with totals.pageviews and 1 other fieldsHigh correlation
totals.pageviews is highly correlated with totals.hits and 3 other fieldsHigh correlation
totals.timeOnSite is highly correlated with totals.hits and 2 other fieldsHigh correlation
totals.transactions is highly correlated with totals.pageviews and 1 other fieldsHigh correlation
totals.totalTransactionRevenue is highly correlated with totals.pageviews and 2 other fieldsHigh correlation
trafficSource.medium is highly correlated with channelGroupingHigh correlation
device.browser is highly correlated with device.operatingSystemHigh correlation
device.operatingSystem is highly correlated with device.browser and 1 other fieldsHigh correlation
device.deviceCategory is highly correlated with device.operatingSystemHigh correlation
totals.timeOnSite has 24783 (44.8%) missing values Missing
totals.transactions has 49516 (89.4%) missing values Missing
totals.newVisits has 13705 (24.7%) missing values Missing
totals.totalTransactionRevenue has 49520 (89.4%) missing values Missing
trafficSource.isTrueDirect has 38404 (69.3%) missing values Missing
totals.bounces has 30667 (55.4%) missing values Missing
hits.type is an unsupported type, check if it needs cleaning or further analysis Unsupported
hits.hour is an unsupported type, check if it needs cleaning or further analysis Unsupported
hits.minute is an unsupported type, check if it needs cleaning or further analysis Unsupported
hits.eCommerceAction.action_type has 55010 (99.3%) zeros Zeros

Reproduction

Analysis started2022-05-13 02:15:04.986521
Analysis finished2022-05-13 02:15:35.505816
Duration30.52 seconds
Software versionpandas-profiling v3.2.0
Download configurationconfig.json

Variables

socialEngagementType
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size432.8 KiB
Not Socially Engaged
55381 

Length

Max length20
Median length20
Mean length20
Min length20

Characters and Unicode

Total characters1107620
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNot Socially Engaged
2nd rowNot Socially Engaged
3rd rowNot Socially Engaged
4th rowNot Socially Engaged
5th rowNot Socially Engaged

Common Values

ValueCountFrequency (%)
Not Socially Engaged55381
100.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
not55381
33.3%
socially55381
33.3%
engaged55381
33.3%

Most occurring characters

ValueCountFrequency (%)
o110762
 
10.0%
110762
 
10.0%
a110762
 
10.0%
l110762
 
10.0%
g110762
 
10.0%
N55381
 
5.0%
t55381
 
5.0%
S55381
 
5.0%
c55381
 
5.0%
i55381
 
5.0%
Other values (5)276905
25.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter830715
75.0%
Uppercase Letter166143
 
15.0%
Space Separator110762
 
10.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o110762
13.3%
a110762
13.3%
l110762
13.3%
g110762
13.3%
t55381
6.7%
c55381
6.7%
i55381
6.7%
y55381
6.7%
n55381
6.7%
e55381
6.7%
Uppercase Letter
ValueCountFrequency (%)
N55381
33.3%
S55381
33.3%
E55381
33.3%
Space Separator
ValueCountFrequency (%)
110762
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin996858
90.0%
Common110762
 
10.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
o110762
11.1%
a110762
11.1%
l110762
11.1%
g110762
11.1%
N55381
 
5.6%
t55381
 
5.6%
S55381
 
5.6%
c55381
 
5.6%
i55381
 
5.6%
y55381
 
5.6%
Other values (4)221524
22.2%
Common
ValueCountFrequency (%)
110762
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1107620
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o110762
 
10.0%
110762
 
10.0%
a110762
 
10.0%
l110762
 
10.0%
g110762
 
10.0%
N55381
 
5.0%
t55381
 
5.0%
S55381
 
5.0%
c55381
 
5.0%
i55381
 
5.0%
Other values (5)276905
25.0%

channelGrouping
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size432.8 KiB
Organic Search
19245 
Social
17636 
Direct
7918 
Referral
7833 
Paid Search
 
1562
Other values (2)
 
1187

Length

Max length14
Median length11
Mean length9.267709142
Min length6

Characters and Unicode

Total characters513255
Distinct characters23
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOrganic Search
2nd rowOrganic Search
3rd rowOrganic Search
4th rowOrganic Search
5th rowOrganic Search

Common Values

ValueCountFrequency (%)
Organic Search19245
34.8%
Social17636
31.8%
Direct7918
14.3%
Referral7833
14.1%
Paid Search1562
 
2.8%
Affiliates782
 
1.4%
Display405
 
0.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
search20807
27.3%
organic19245
25.3%
social17636
23.1%
direct7918
 
10.4%
referral7833
 
10.3%
paid1562
 
2.1%
affiliates782
 
1.0%
display405
 
0.5%

Most occurring characters

ValueCountFrequency (%)
a68270
13.3%
c65606
12.8%
r63636
12.4%
i48330
9.4%
e45173
8.8%
S38443
 
7.5%
l26656
 
5.2%
20807
 
4.1%
h20807
 
4.1%
O19245
 
3.7%
Other values (13)96282
18.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter416260
81.1%
Uppercase Letter76188
 
14.8%
Space Separator20807
 
4.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a68270
16.4%
c65606
15.8%
r63636
15.3%
i48330
11.6%
e45173
10.9%
l26656
 
6.4%
h20807
 
5.0%
g19245
 
4.6%
n19245
 
4.6%
o17636
 
4.2%
Other values (6)21656
 
5.2%
Uppercase Letter
ValueCountFrequency (%)
S38443
50.5%
O19245
25.3%
D8323
 
10.9%
R7833
 
10.3%
P1562
 
2.1%
A782
 
1.0%
Space Separator
ValueCountFrequency (%)
20807
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin492448
95.9%
Common20807
 
4.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a68270
13.9%
c65606
13.3%
r63636
12.9%
i48330
9.8%
e45173
9.2%
S38443
7.8%
l26656
 
5.4%
h20807
 
4.2%
O19245
 
3.9%
g19245
 
3.9%
Other values (12)77037
15.6%
Common
ValueCountFrequency (%)
20807
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII513255
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a68270
13.3%
c65606
12.8%
r63636
12.4%
i48330
9.4%
e45173
8.8%
S38443
 
7.5%
l26656
 
5.2%
20807
 
4.1%
h20807
 
4.1%
O19245
 
3.7%
Other values (13)96282
18.8%

date
Real number (ℝ≥0)

Distinct184
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20162192.01
Minimum20160801
Maximum20170131
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size432.8 KiB

Quantile statistics

Minimum20160801
5-th percentile20160811
Q120160921
median20161102
Q320161206
95-th percentile20170120
Maximum20170131
Range9330
Interquartile range (IQR)285

Descriptive statistics

Standard deviation3040.550087
Coefficient of variation (CV)0.0001508045398
Kurtosis2.9326919
Mean20162192.01
Median Absolute Deviation (MAD)112
Skewness2.217409118
Sum1.116602356 × 1012
Variance9244944.834
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20161128547
 
1.0%
20161115525
 
0.9%
20161004497
 
0.9%
20161201479
 
0.9%
20161205478
 
0.9%
20161129470
 
0.8%
20161026469
 
0.8%
20161116466
 
0.8%
20161114464
 
0.8%
20161130457
 
0.8%
Other values (174)50529
91.2%
ValueCountFrequency (%)
20160801208
0.4%
20160802245
0.4%
20160803327
0.6%
20160804342
0.6%
20160805322
0.6%
20160806185
0.3%
20160807183
0.3%
20160808290
0.5%
20160809298
0.5%
20160810308
0.6%
ValueCountFrequency (%)
20170131237
0.4%
20170130264
0.5%
20170129194
0.4%
20170128155
 
0.3%
20170127224
0.4%
20170126233
0.4%
20170125352
0.6%
20170124413
0.7%
20170123236
0.4%
20170122178
0.3%

fullVisitorId
Real number (ℝ≥0)

Distinct51510
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.517018685 × 1018
Minimum3.579413597 × 1013
Maximum9.999357271 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size432.8 KiB

Quantile statistics

Minimum3.579413597 × 1013
5-th percentile2.153310022 × 1017
Q11.649316738 × 1018
median4.398696916 × 1018
Q37.19973052 × 1018
95-th percentile9.450484153 × 1018
Maximum9.999357271 × 1018
Range9.999321477 × 1018
Interquartile range (IQR)5.550413782 × 1018

Descriptive statistics

Standard deviation3.060419211 × 1018
Coefficient of variation (CV)0.6775307841
Kurtosis-1.274019915
Mean4.517018685 × 1018
Median Absolute Deviation (MAD)2.772709824 × 1018
Skewness0.1307247016
Sum2.501570118 × 1023
Variance9.366165748 × 1036
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.856749148 × 101821
 
< 0.1%
3.608475193 × 101820
 
< 0.1%
6.760732402 × 101818
 
< 0.1%
8.248397261 × 101716
 
< 0.1%
9.497189156 × 101715
 
< 0.1%
1.957458976 × 101815
 
< 0.1%
4.578640586 × 101815
 
< 0.1%
1.063651652 × 101813
 
< 0.1%
6.254908847 × 101812
 
< 0.1%
1.956307608 × 101811
 
< 0.1%
Other values (51500)55225
99.7%
ValueCountFrequency (%)
3.579413597 × 10131
< 0.1%
1.52474579 × 10141
< 0.1%
1.664652655 × 10141
< 0.1%
2.037409535 × 10141
< 0.1%
2.215385223 × 10141
< 0.1%
3.033655489 × 10141
< 0.1%
3.488329061 × 10141
< 0.1%
4.353240613 × 10141
< 0.1%
4.923040577 × 10141
< 0.1%
5.050029954 × 10141
< 0.1%
ValueCountFrequency (%)
9.999357271 × 10181
< 0.1%
9.998996003 × 10181
< 0.1%
9.998663691 × 10181
< 0.1%
9.998628427 × 10181
< 0.1%
9.998597322 × 10181
< 0.1%
9.998297178 × 10181
< 0.1%
9.998185327 × 10181
< 0.1%
9.998113232 × 10181
< 0.1%
9.997950947 × 10181
< 0.1%
9.997801738 × 10181
< 0.1%

totals.hits
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct211
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.152904426
Minimum1
Maximum500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size432.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q37
95-th percentile39
Maximum500
Range499
Interquartile range (IQR)6

Descriptive statistics

Standard deviation17.75991486
Coefficient of variation (CV)2.178354355
Kurtosis97.11865055
Mean8.152904426
Median Absolute Deviation (MAD)1
Skewness6.898052433
Sum451516
Variance315.4145758
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
124516
44.3%
28220
 
14.8%
33532
 
6.4%
42252
 
4.1%
51679
 
3.0%
61312
 
2.4%
71081
 
2.0%
8887
 
1.6%
9779
 
1.4%
10658
 
1.2%
Other values (201)10465
18.9%
ValueCountFrequency (%)
124516
44.3%
28220
 
14.8%
33532
 
6.4%
42252
 
4.1%
51679
 
3.0%
61312
 
2.4%
71081
 
2.0%
8887
 
1.6%
9779
 
1.4%
10658
 
1.2%
ValueCountFrequency (%)
5003
< 0.1%
4711
 
< 0.1%
4061
 
< 0.1%
3871
 
< 0.1%
3851
 
< 0.1%
3821
 
< 0.1%
3781
 
< 0.1%
3611
 
< 0.1%
3471
 
< 0.1%
3311
 
< 0.1%

totals.pageviews
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct156
Distinct (%)0.3%
Missing8
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean6.548353891
Minimum1
Maximum469
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size432.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q36
95-th percentile30
Maximum469
Range468
Interquartile range (IQR)5

Descriptive statistics

Standard deviation13.01594648
Coefficient of variation (CV)1.987666931
Kurtosis114.3956568
Mean6.548353891
Median Absolute Deviation (MAD)1
Skewness6.921747415
Sum362602
Variance169.4148628
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
124850
44.9%
28607
 
15.5%
33732
 
6.7%
42374
 
4.3%
51828
 
3.3%
61298
 
2.3%
71118
 
2.0%
8888
 
1.6%
9794
 
1.4%
10696
 
1.3%
Other values (146)9188
 
16.6%
ValueCountFrequency (%)
124850
44.9%
28607
 
15.5%
33732
 
6.7%
42374
 
4.3%
51828
 
3.3%
61298
 
2.3%
71118
 
2.0%
8888
 
1.6%
9794
 
1.4%
10696
 
1.3%
ValueCountFrequency (%)
4691
< 0.1%
4311
< 0.1%
3511
< 0.1%
3411
< 0.1%
3231
< 0.1%
3091
< 0.1%
3051
< 0.1%
2701
< 0.1%
2331
< 0.1%
2321
< 0.1%

totals.timeOnSite
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct2694
Distinct (%)8.8%
Missing24783
Missing (%)44.8%
Infinite0
Infinite (%)0.0%
Mean392.2494281
Minimum1
Maximum15047
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size432.8 KiB

Quantile statistics

Minimum1
5-th percentile6
Q144
median114
Q3459
95-th percentile1706.15
Maximum15047
Range15046
Interquartile range (IQR)415

Descriptive statistics

Standard deviation673.3060416
Coefficient of variation (CV)1.716525235
Kurtosis38.08969347
Mean392.2494281
Median Absolute Deviation (MAD)99
Skewness4.336266284
Sum12002048
Variance453341.0256
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
51465
 
0.8%
5440
 
0.8%
4436
 
0.8%
50382
 
0.7%
6381
 
0.7%
52380
 
0.7%
7290
 
0.5%
9276
 
0.5%
54274
 
0.5%
53272
 
0.5%
Other values (2684)27002
48.8%
(Missing)24783
44.8%
ValueCountFrequency (%)
135
 
0.1%
262
 
0.1%
3200
0.4%
4436
0.8%
5440
0.8%
6381
0.7%
7290
0.5%
8252
0.5%
9276
0.5%
10213
0.4%
ValueCountFrequency (%)
150471
< 0.1%
142791
< 0.1%
124661
< 0.1%
110941
< 0.1%
100761
< 0.1%
100461
< 0.1%
92751
< 0.1%
89991
< 0.1%
88111
< 0.1%
88051
< 0.1%

totals.transactions
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct7
Distinct (%)0.1%
Missing49516
Missing (%)89.4%
Infinite0
Infinite (%)0.0%
Mean1.047740835
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size432.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile1
Maximum8
Range7
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.3253271431
Coefficient of variation (CV)0.310503449
Kurtosis158.2772429
Mean1.047740835
Median Absolute Deviation (MAD)0
Skewness10.96153301
Sum6145
Variance0.10583775
MonotonicityNot monotonic
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
15671
 
10.2%
2155
 
0.3%
319
 
< 0.1%
59
 
< 0.1%
65
 
< 0.1%
44
 
< 0.1%
82
 
< 0.1%
(Missing)49516
89.4%
ValueCountFrequency (%)
15671
10.2%
2155
 
0.3%
319
 
< 0.1%
44
 
< 0.1%
59
 
< 0.1%
65
 
< 0.1%
82
 
< 0.1%
ValueCountFrequency (%)
82
 
< 0.1%
65
 
< 0.1%
59
 
< 0.1%
44
 
< 0.1%
319
 
< 0.1%
2155
 
0.3%
15671
10.2%

totals.newVisits
Categorical

CONSTANT
HIGH CORRELATION
MISSING
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing13705
Missing (%)24.7%
Memory size432.8 KiB
1.0
41676 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters125028
Distinct characters3
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row1.0
3rd row1.0
4th row1.0
5th row1.0

Common Values

ValueCountFrequency (%)
1.041676
75.3%
(Missing)13705
 
24.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
1.041676
100.0%

Most occurring characters

ValueCountFrequency (%)
141676
33.3%
.41676
33.3%
041676
33.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number83352
66.7%
Other Punctuation41676
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
141676
50.0%
041676
50.0%
Other Punctuation
ValueCountFrequency (%)
.41676
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common125028
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
141676
33.3%
.41676
33.3%
041676
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII125028
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
141676
33.3%
.41676
33.3%
041676
33.3%

totals.totalTransactionRevenue
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct3688
Distinct (%)62.9%
Missing49520
Missing (%)89.4%
Infinite0
Infinite (%)0.0%
Mean146301015.2
Minimum1200000
Maximum1.603275 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size432.8 KiB

Quantile statistics

Minimum1200000
5-th percentile15590000
Q130960000
median58460000
Q3121740000
95-th percentile557000000
Maximum1.603275 × 1010
Range1.603155 × 1010
Interquartile range (IQR)90780000

Descriptive statistics

Standard deviation386609149.2
Coefficient of variation (CV)2.642559579
Kurtosis586.1038572
Mean146301015.2
Median Absolute Deviation (MAD)33490000
Skewness18.28000083
Sum8.5747025 × 1011
Variance1.494666342 × 1017
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2499000056
 
0.1%
2399000053
 
0.1%
1999000043
 
0.1%
2599000042
 
0.1%
2199000038
 
0.1%
2299000036
 
0.1%
2099000033
 
0.1%
1899000033
 
0.1%
1799000028
 
0.1%
2699000026
 
< 0.1%
Other values (3678)5473
 
9.9%
(Missing)49520
89.4%
ValueCountFrequency (%)
12000001
 
< 0.1%
20400001
 
< 0.1%
24900001
 
< 0.1%
29900003
< 0.1%
30100001
 
< 0.1%
31600001
 
< 0.1%
32000002
< 0.1%
34000001
 
< 0.1%
35000001
 
< 0.1%
35300001
 
< 0.1%
ValueCountFrequency (%)
1.603275 × 10101
< 0.1%
92277400001
< 0.1%
70035000001
< 0.1%
62395800001
< 0.1%
59455800001
< 0.1%
48496000001
< 0.1%
40875000001
< 0.1%
40130800001
< 0.1%
33948000001
< 0.1%
30136400001
< 0.1%

geoNetwork.country
Categorical

HIGH CARDINALITY

Distinct173
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size432.8 KiB
United States
23746 
India
2526 
Vietnam
 
2219
United Kingdom
 
1742
Turkey
 
1730
Other values (168)
23418 

Length

Max length22
Median length20
Mean length9.837417165
Min length4

Characters and Unicode

Total characters544806
Distinct characters60
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)< 0.1%

Sample

1st rowBelarus
2nd rowGreece
3rd rowIreland
4th rowIndonesia
5th rowAustralia

Common Values

ValueCountFrequency (%)
United States23746
42.9%
India2526
 
4.6%
Vietnam2219
 
4.0%
United Kingdom1742
 
3.1%
Turkey1730
 
3.1%
Thailand1671
 
3.0%
Brazil1412
 
2.5%
Canada1406
 
2.5%
Japan1007
 
1.8%
Mexico950
 
1.7%
Other values (163)16972
30.6%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
united25701
31.0%
states23746
28.6%
india2526
 
3.0%
vietnam2219
 
2.7%
kingdom1742
 
2.1%
turkey1730
 
2.1%
thailand1671
 
2.0%
brazil1412
 
1.7%
canada1406
 
1.7%
japan1007
 
1.2%
Other values (189)19735
23.8%

Most occurring characters

ValueCountFrequency (%)
t79540
14.6%
e65176
12.0%
a58756
10.8%
n47905
8.8%
i47461
8.7%
d36536
 
6.7%
s29552
 
5.4%
27514
 
5.1%
S26126
 
4.8%
U26099
 
4.8%
Other values (50)100141
18.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter433940
79.7%
Uppercase Letter82878
 
15.2%
Space Separator27514
 
5.1%
Open Punctuation185
 
< 0.1%
Close Punctuation185
 
< 0.1%
Other Punctuation79
 
< 0.1%
Final Punctuation19
 
< 0.1%
Dash Punctuation6
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t79540
18.3%
e65176
15.0%
a58756
13.5%
n47905
11.0%
i47461
10.9%
d36536
8.4%
s29552
 
6.8%
r11727
 
2.7%
l9246
 
2.1%
o8330
 
1.9%
Other values (19)39711
9.2%
Uppercase Letter
ValueCountFrequency (%)
S26126
31.5%
U26099
31.5%
I4423
 
5.3%
T4155
 
5.0%
K2484
 
3.0%
V2384
 
2.9%
C2328
 
2.8%
B2074
 
2.5%
P2059
 
2.5%
A1841
 
2.2%
Other values (14)8905
 
10.7%
Other Punctuation
ValueCountFrequency (%)
&74
93.7%
.5
 
6.3%
Space Separator
ValueCountFrequency (%)
27514
100.0%
Open Punctuation
ValueCountFrequency (%)
(185
100.0%
Close Punctuation
ValueCountFrequency (%)
)185
100.0%
Final Punctuation
ValueCountFrequency (%)
19
100.0%
Dash Punctuation
ValueCountFrequency (%)
-6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin516818
94.9%
Common27988
 
5.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
t79540
15.4%
e65176
12.6%
a58756
11.4%
n47905
9.3%
i47461
9.2%
d36536
7.1%
s29552
 
5.7%
S26126
 
5.1%
U26099
 
5.0%
r11727
 
2.3%
Other values (43)87940
17.0%
Common
ValueCountFrequency (%)
27514
98.3%
(185
 
0.7%
)185
 
0.7%
&74
 
0.3%
19
 
0.1%
-6
 
< 0.1%
.5
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII544754
> 99.9%
None33
 
< 0.1%
Punctuation19
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t79540
14.6%
e65176
12.0%
a58756
10.8%
n47905
8.8%
i47461
8.7%
d36536
 
6.7%
s29552
 
5.4%
27514
 
5.1%
S26126
 
4.8%
U26099
 
4.8%
Other values (46)100089
18.4%
None
ValueCountFrequency (%)
ô19
57.6%
é12
36.4%
ç2
 
6.1%
Punctuation
ValueCountFrequency (%)
19
100.0%

trafficSource.source
Categorical

HIGH CARDINALITY

Distinct108
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size432.8 KiB
(direct)
27422 
youtube.com
17170 
google
7047 
Partners
 
782
analytics.google.com
 
575
Other values (103)
 
2385

Length

Max length31
Median length8
Mean length8.914790271
Min length3

Characters and Unicode

Total characters493710
Distinct characters35
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)0.1%

Sample

1st rowgoogle
2nd rowgoogle
3rd rowgoogle
4th rowgoogle
5th rowgoogle

Common Values

ValueCountFrequency (%)
(direct)27422
49.5%
youtube.com17170
31.0%
google7047
 
12.7%
Partners782
 
1.4%
analytics.google.com575
 
1.0%
dfa405
 
0.7%
sites.google.com236
 
0.4%
baidu235
 
0.4%
google.com199
 
0.4%
siliconvalley.about.com151
 
0.3%
Other values (98)1159
 
2.1%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
direct27422
49.5%
youtube.com17170
31.0%
google7047
 
12.7%
partners782
 
1.4%
analytics.google.com575
 
1.0%
dfa405
 
0.7%
sites.google.com236
 
0.4%
baidu235
 
0.4%
google.com199
 
0.4%
siliconvalley.about.com151
 
0.3%
Other values (98)1159
 
2.1%

Most occurring characters

ValueCountFrequency (%)
e54651
11.1%
o54467
11.0%
c47746
9.7%
t46725
9.5%
u34985
 
7.1%
r29421
 
6.0%
i29311
 
5.9%
d28419
 
5.8%
(27422
 
5.6%
)27422
 
5.6%
Other values (25)113141
22.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter417274
84.5%
Open Punctuation27422
 
5.6%
Close Punctuation27422
 
5.6%
Other Punctuation20794
 
4.2%
Uppercase Letter782
 
0.2%
Decimal Number13
 
< 0.1%
Dash Punctuation3
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e54651
13.1%
o54467
13.1%
c47746
11.4%
t46725
11.2%
u34985
8.4%
r29421
7.1%
i29311
7.0%
d28419
6.8%
m19465
 
4.7%
y17992
 
4.3%
Other values (16)54092
13.0%
Decimal Number
ValueCountFrequency (%)
05
38.5%
25
38.5%
52
 
15.4%
31
 
7.7%
Open Punctuation
ValueCountFrequency (%)
(27422
100.0%
Close Punctuation
ValueCountFrequency (%)
)27422
100.0%
Other Punctuation
ValueCountFrequency (%)
.20794
100.0%
Uppercase Letter
ValueCountFrequency (%)
P782
100.0%
Dash Punctuation
ValueCountFrequency (%)
-3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin418056
84.7%
Common75654
 
15.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e54651
13.1%
o54467
13.0%
c47746
11.4%
t46725
11.2%
u34985
8.4%
r29421
7.0%
i29311
7.0%
d28419
6.8%
m19465
 
4.7%
y17992
 
4.3%
Other values (17)54874
13.1%
Common
ValueCountFrequency (%)
(27422
36.2%
)27422
36.2%
.20794
27.5%
05
 
< 0.1%
25
 
< 0.1%
-3
 
< 0.1%
52
 
< 0.1%
31
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII493710
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e54651
11.1%
o54467
11.0%
c47746
9.7%
t46725
9.5%
u34985
 
7.1%
r29421
 
6.0%
i29311
 
5.9%
d28419
 
5.8%
(27422
 
5.6%
)27422
 
5.6%
Other values (25)113141
22.9%

trafficSource.medium
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size432.8 KiB
(none)
27422 
referral
19330 
organic
6980 
affiliate
 
782
cpc
 
462

Length

Max length9
Median length8
Mean length6.819504884
Min length3

Characters and Unicode

Total characters377671
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st roworganic
2nd roworganic
3rd roworganic
4th roworganic
5th roworganic

Common Values

ValueCountFrequency (%)
(none)27422
49.5%
referral19330
34.9%
organic6980
 
12.6%
affiliate782
 
1.4%
cpc462
 
0.8%
cpm405
 
0.7%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
none27422
49.5%
referral19330
34.9%
organic6980
 
12.6%
affiliate782
 
1.4%
cpc462
 
0.8%
cpm405
 
0.7%

Most occurring characters

ValueCountFrequency (%)
e66864
17.7%
r64970
17.2%
n61824
16.4%
o34402
9.1%
a27874
7.4%
(27422
7.3%
)27422
7.3%
f20894
 
5.5%
l20112
 
5.3%
i8544
 
2.3%
Other values (5)17343
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter322827
85.5%
Open Punctuation27422
 
7.3%
Close Punctuation27422
 
7.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e66864
20.7%
r64970
20.1%
n61824
19.2%
o34402
10.7%
a27874
8.6%
f20894
 
6.5%
l20112
 
6.2%
i8544
 
2.6%
c8309
 
2.6%
g6980
 
2.2%
Other values (3)2054
 
0.6%
Open Punctuation
ValueCountFrequency (%)
(27422
100.0%
Close Punctuation
ValueCountFrequency (%)
)27422
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin322827
85.5%
Common54844
 
14.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e66864
20.7%
r64970
20.1%
n61824
19.2%
o34402
10.7%
a27874
8.6%
f20894
 
6.5%
l20112
 
6.2%
i8544
 
2.6%
c8309
 
2.6%
g6980
 
2.2%
Other values (3)2054
 
0.6%
Common
ValueCountFrequency (%)
(27422
50.0%
)27422
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII377671
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e66864
17.7%
r64970
17.2%
n61824
16.4%
o34402
9.1%
a27874
7.4%
(27422
7.3%
)27422
7.3%
f20894
 
5.5%
l20112
 
5.3%
i8544
 
2.3%
Other values (5)17343
 
4.6%

trafficSource.isTrueDirect
Boolean

CONSTANT
HIGH CORRELATION
MISSING
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing38404
Missing (%)69.3%
Memory size432.8 KiB
True
16977 
(Missing)
38404 
ValueCountFrequency (%)
True16977
30.7%
(Missing)38404
69.3%

device.browser
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct27
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size432.8 KiB
Chrome
38027 
Safari
12182 
Firefox
 
2028
Internet Explorer
 
1055
Edge
 
557
Other values (22)
 
1532

Length

Max length24
Median length6
Mean length6.353189722
Min length4

Characters and Unicode

Total characters351846
Distinct characters53
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)< 0.1%

Sample

1st rowChrome
2nd rowChrome
3rd rowChrome
4th rowChrome
5th rowChrome

Common Values

ValueCountFrequency (%)
Chrome38027
68.7%
Safari12182
 
22.0%
Firefox2028
 
3.7%
Internet Explorer1055
 
1.9%
Edge557
 
1.0%
Opera360
 
0.7%
Safari (in-app)308
 
0.6%
Opera Mini277
 
0.5%
Android Webview193
 
0.3%
YaBrowser116
 
0.2%
Other values (17)278
 
0.5%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
chrome38027
66.2%
safari12490
 
21.7%
firefox2028
 
3.5%
internet1055
 
1.8%
explorer1055
 
1.8%
opera637
 
1.1%
edge557
 
1.0%
in-app308
 
0.5%
mini277
 
0.5%
android223
 
0.4%
Other values (23)803
 
1.4%

Most occurring characters

ValueCountFrequency (%)
r57124
16.2%
e45124
12.8%
o41761
11.9%
C38216
10.9%
m38079
10.8%
h38045
10.8%
a26145
7.4%
i15886
 
4.5%
f14526
 
4.1%
S12525
 
3.6%
Other values (43)24415
6.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter291364
82.8%
Uppercase Letter57467
 
16.3%
Space Separator2079
 
0.6%
Open Punctuation308
 
0.1%
Dash Punctuation308
 
0.1%
Close Punctuation308
 
0.1%
Decimal Number9
 
< 0.1%
Connector Punctuation3
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r57124
19.6%
e45124
15.5%
o41761
14.3%
m38079
13.1%
h38045
13.1%
a26145
9.0%
i15886
 
5.5%
f14526
 
5.0%
x3100
 
1.1%
n3016
 
1.0%
Other values (14)8558
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
C38216
66.5%
S12525
 
21.8%
F2029
 
3.5%
E1623
 
2.8%
I1056
 
1.8%
O648
 
1.1%
M335
 
0.6%
B288
 
0.5%
A274
 
0.5%
W193
 
0.3%
Other values (9)280
 
0.5%
Decimal Number
ValueCountFrequency (%)
04
44.4%
12
22.2%
51
 
11.1%
41
 
11.1%
21
 
11.1%
Space Separator
ValueCountFrequency (%)
2079
100.0%
Open Punctuation
ValueCountFrequency (%)
(308
100.0%
Dash Punctuation
ValueCountFrequency (%)
-308
100.0%
Close Punctuation
ValueCountFrequency (%)
)308
100.0%
Connector Punctuation
ValueCountFrequency (%)
_3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin348831
99.1%
Common3015
 
0.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
r57124
16.4%
e45124
12.9%
o41761
12.0%
C38216
11.0%
m38079
10.9%
h38045
10.9%
a26145
7.5%
i15886
 
4.6%
f14526
 
4.2%
S12525
 
3.6%
Other values (33)21400
 
6.1%
Common
ValueCountFrequency (%)
2079
69.0%
(308
 
10.2%
-308
 
10.2%
)308
 
10.2%
04
 
0.1%
_3
 
0.1%
12
 
0.1%
51
 
< 0.1%
41
 
< 0.1%
21
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII351846
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r57124
16.2%
e45124
12.8%
o41761
11.9%
C38216
10.9%
m38079
10.8%
h38045
10.8%
a26145
7.4%
i15886
 
4.5%
f14526
 
4.1%
S12525
 
3.6%
Other values (43)24415
6.9%

device.operatingSystem
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct13
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size432.8 KiB
Windows
21174 
Macintosh
19138 
Android
5717 
iOS
5049 
Linux
2219 
Other values (8)
 
2084

Length

Max length13
Median length12
Mean length7.328560337
Min length3

Characters and Unicode

Total characters405863
Distinct characters36
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowWindows
2nd rowAndroid
3rd rowWindows
4th rowAndroid
5th rowiOS

Common Values

ValueCountFrequency (%)
Windows21174
38.2%
Macintosh19138
34.6%
Android5717
 
10.3%
iOS5049
 
9.1%
Linux2219
 
4.0%
Chrome OS1733
 
3.1%
(not set)215
 
0.4%
Windows Phone92
 
0.2%
Samsung12
 
< 0.1%
BlackBerry12
 
< 0.1%
Other values (3)20
 
< 0.1%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
windows21266
37.0%
macintosh19138
33.3%
android5717
 
10.0%
ios5049
 
8.8%
linux2219
 
3.9%
os1740
 
3.0%
chrome1733
 
3.0%
not215
 
0.4%
set215
 
0.4%
phone92
 
0.2%
Other values (6)55
 
0.1%

Most occurring characters

ValueCountFrequency (%)
i53429
13.2%
n48681
12.0%
o48181
11.9%
s40631
10.0%
d32711
8.1%
W21277
 
5.2%
w21266
 
5.2%
h20963
 
5.2%
t19579
 
4.8%
a19162
 
4.7%
Other values (26)79983
19.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter339565
83.7%
Uppercase Letter63810
 
15.7%
Space Separator2058
 
0.5%
Open Punctuation215
 
0.1%
Close Punctuation215
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i53429
15.7%
n48681
14.3%
o48181
14.2%
s40631
12.0%
d32711
9.6%
w21266
 
6.3%
h20963
 
6.2%
t19579
 
5.8%
a19162
 
5.6%
c19150
 
5.6%
Other values (11)15812
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
W21277
33.3%
M19138
30.0%
S6801
 
10.7%
O6789
 
10.6%
A5717
 
9.0%
L2219
 
3.5%
C1733
 
2.7%
P92
 
0.1%
B24
 
< 0.1%
N11
 
< 0.1%
Other values (2)9
 
< 0.1%
Space Separator
ValueCountFrequency (%)
2058
100.0%
Open Punctuation
ValueCountFrequency (%)
(215
100.0%
Close Punctuation
ValueCountFrequency (%)
)215
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin403375
99.4%
Common2488
 
0.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
i53429
13.2%
n48681
12.1%
o48181
11.9%
s40631
10.1%
d32711
8.1%
W21277
 
5.3%
w21266
 
5.3%
h20963
 
5.2%
t19579
 
4.9%
a19162
 
4.8%
Other values (23)77495
19.2%
Common
ValueCountFrequency (%)
2058
82.7%
(215
 
8.6%
)215
 
8.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII405863
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i53429
13.2%
n48681
12.0%
o48181
11.9%
s40631
10.0%
d32711
8.1%
W21277
 
5.2%
w21266
 
5.2%
h20963
 
5.2%
t19579
 
4.8%
a19162
 
4.7%
Other values (26)79983
19.7%

device.deviceCategory
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size432.8 KiB
desktop
44180 
mobile
9658 
tablet
 
1543

Length

Max length7
Median length7
Mean length6.79774652
Min length6

Characters and Unicode

Total characters376466
Distinct characters12
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowdesktop
2nd rowmobile
3rd rowdesktop
4th rowmobile
5th rowmobile

Common Values

ValueCountFrequency (%)
desktop44180
79.8%
mobile9658
 
17.4%
tablet1543
 
2.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
desktop44180
79.8%
mobile9658
 
17.4%
tablet1543
 
2.8%

Most occurring characters

ValueCountFrequency (%)
e55381
14.7%
o53838
14.3%
t47266
12.6%
d44180
11.7%
s44180
11.7%
k44180
11.7%
p44180
11.7%
b11201
 
3.0%
l11201
 
3.0%
m9658
 
2.6%
Other values (2)11201
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter376466
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e55381
14.7%
o53838
14.3%
t47266
12.6%
d44180
11.7%
s44180
11.7%
k44180
11.7%
p44180
11.7%
b11201
 
3.0%
l11201
 
3.0%
m9658
 
2.6%
Other values (2)11201
 
3.0%

Most occurring scripts

ValueCountFrequency (%)
Latin376466
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e55381
14.7%
o53838
14.3%
t47266
12.6%
d44180
11.7%
s44180
11.7%
k44180
11.7%
p44180
11.7%
b11201
 
3.0%
l11201
 
3.0%
m9658
 
2.6%
Other values (2)11201
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII376466
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e55381
14.7%
o53838
14.3%
t47266
12.6%
d44180
11.7%
s44180
11.7%
k44180
11.7%
p44180
11.7%
b11201
 
3.0%
l11201
 
3.0%
m9658
 
2.6%
Other values (2)11201
 
3.0%

hits.type
Unsupported

REJECTED
UNSUPPORTED

Missing0
Missing (%)0.0%
Memory size432.8 KiB

hits.hour
Unsupported

REJECTED
UNSUPPORTED

Missing0
Missing (%)0.0%
Memory size432.8 KiB

hits.minute
Unsupported

REJECTED
UNSUPPORTED

Missing0
Missing (%)0.0%
Memory size432.8 KiB

totals.bounces
Categorical

CONSTANT
HIGH CORRELATION
MISSING
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing30667
Missing (%)55.4%
Memory size432.8 KiB
1.0
24714 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters74142
Distinct characters3
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row1.0
3rd row1.0
4th row1.0
5th row1.0

Common Values

ValueCountFrequency (%)
1.024714
44.6%
(Missing)30667
55.4%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
1.024714
100.0%

Most occurring characters

ValueCountFrequency (%)
124714
33.3%
.24714
33.3%
024714
33.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number49428
66.7%
Other Punctuation24714
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
124714
50.0%
024714
50.0%
Other Punctuation
ValueCountFrequency (%)
.24714
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common74142
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
124714
33.3%
.24714
33.3%
024714
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII74142
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
124714
33.3%
.24714
33.3%
024714
33.3%

hits.eCommerceAction.action_type
Real number (ℝ≥0)

ZEROS

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.01410230946
Minimum0
Maximum6
Zeros55010
Zeros (%)99.3%
Negative0
Negative (%)0.0%
Memory size432.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum6
Range6
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.1966671608
Coefficient of variation (CV)13.9457414
Kurtosis326.7933552
Mean0.01410230946
Median Absolute Deviation (MAD)0
Skewness16.95036344
Sum781
Variance0.03867797213
MonotonicityNot monotonic
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
055010
99.3%
1148
 
0.3%
2103
 
0.2%
373
 
0.1%
429
 
0.1%
516
 
< 0.1%
62
 
< 0.1%
ValueCountFrequency (%)
055010
99.3%
1148
 
0.3%
2103
 
0.2%
373
 
0.1%
429
 
0.1%
516
 
< 0.1%
62
 
< 0.1%
ValueCountFrequency (%)
62
 
< 0.1%
516
 
< 0.1%
429
 
0.1%
373
 
0.1%
2103
 
0.2%
1148
 
0.3%
055010
99.3%

Interactions

Correlations

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

socialEngagementTypechannelGroupingdatefullVisitorIdtotals.hitstotals.pageviewstotals.timeOnSitetotals.transactionstotals.newVisitstotals.totalTransactionRevenuegeoNetwork.countrytrafficSource.sourcetrafficSource.mediumtrafficSource.isTrueDirectdevice.browserdevice.operatingSystemdevice.deviceCategoryhits.typehits.hourhits.minutetotals.bounceshits.eCommerceAction.action_type
0Not Socially EngagedOrganic Search20170101816080443529264014411.0NaNNaN1.0NaNBelarusgoogleorganicNaNChromeWindowsdesktop[PAGE][5][46]1.00
1Not Socially EngagedOrganic Search20170101276723291165810139111.0NaNNaN1.0NaNGreecegoogleorganicNaNChromeAndroidmobile[PAGE][12][25]1.00
2Not Socially EngagedOrganic Search20170101546210526149387193911.0NaNNaN1.0NaNIrelandgoogleorganicNaNChromeWindowsdesktop[PAGE][23][22]1.00
3Not Socially EngagedOrganic Search201701019519170207184298011.0NaNNaN1.0NaNIndonesiagoogleorganicNaNChromeAndroidmobile[PAGE][5][21]1.00
4Not Socially EngagedOrganic Search20170101669250554491010156311.0NaNNaN1.0NaNAustraliagoogleorganicNaNChromeiOSmobile[PAGE][19][28]1.00
5Not Socially EngagedOrganic Search20170101290794681522220387911.0NaNNaN1.0NaNMalaysiagoogleorganicNaNChromeMacintoshdesktop[PAGE][0][33]1.00
6Not Socially EngagedDirect20170101565218105572341556411.0NaNNaN1.0NaNNetherlands(direct)(none)TrueChromeLinuxdesktop[PAGE][21][29]1.00
7Not Socially EngagedDirect20170101781751921954742450011.0NaNNaN1.0NaNUnited States(direct)(none)TrueChromeAndroidmobile[PAGE][5][17]1.00
8Not Socially EngagedOrganic Search20170101836245223438894422411.0NaNNaN1.0NaNNetherlandsgoogleorganicNaNSafariiOStablet[PAGE][11][21]1.00
9Not Socially EngagedPaid Search20170101130914122320460871211.0NaNNaN1.0NaNSaudi ArabiagooglecpcNaNSafariiOSmobile[PAGE][6][43]1.00

Last rows

socialEngagementTypechannelGroupingdatefullVisitorIdtotals.hitstotals.pageviewstotals.timeOnSitetotals.transactionstotals.newVisitstotals.totalTransactionRevenuegeoNetwork.countrytrafficSource.sourcetrafficSource.mediumtrafficSource.isTrueDirectdevice.browserdevice.operatingSystemdevice.deviceCategoryhits.typehits.hourhits.minutetotals.bounceshits.eCommerceAction.action_type
55371Not Socially EngagedReferral2016123184615608606015811761412.0356.01.0NaN37180000.0United States(direct)(none)TrueChromeMacintoshdesktop[PAGE, EVENT, PAGE, PAGE, EVENT, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE][15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15][1, 2, 2, 2, 2, 2, 2, 2, 3, 6, 6, 7, 7, 7]NaN0
55372Not Socially EngagedOrganic Search2016123113236729904152096131817.0618.01.01.014990000.0United States(direct)(none)NaNChromeMacintoshdesktop[PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, EVENT, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE][8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8][14, 14, 14, 15, 16, 17, 17, 17, 18, 18, 18, 18, 18, 21, 22, 24, 24, 24]NaN0
55373Not Socially EngagedOrganic Search2016123182240290322209193381815.0386.01.01.047980000.0United States(direct)(none)NaNSafariiOStablet[PAGE, EVENT, PAGE, PAGE, PAGE, EVENT, PAGE, EVENT, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE][9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9][52, 52, 52, 52, 52, 52, 53, 53, 53, 53, 54, 54, 54, 55, 55, 57, 58, 58]NaN0
55374Not Socially EngagedReferral2016123118758348558395673952018.0478.01.0NaN97030000.0United States(direct)(none)TrueChromeMacintoshdesktop[PAGE, PAGE, PAGE, PAGE, EVENT, EVENT, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE][14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14][41, 42, 43, 44, 44, 44, 44, 44, 45, 45, 46, 46, 46, 46, 47, 47, 48, 49, 49, 49]NaN0
55375Not Socially EngagedOrganic Search2016123174001533340640566832217.0523.01.0NaN66980000.0United States(direct)(none)TrueChromeChrome OSdesktop[PAGE, EVENT, PAGE, EVENT, PAGE, PAGE, PAGE, PAGE, PAGE, EVENT, PAGE, PAGE, EVENT, PAGE, EVENT, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE][12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12][14, 14, 14, 14, 14, 15, 17, 18, 19, 19, 19, 19, 20, 20, 20, 20, 20, 20, 21, 22, 23, 23]NaN0
55376Not Socially EngagedDirect2016123133913712076058385513427.0747.01.01.020590000.0United States(direct)(none)TrueChromeWindowsdesktop[PAGE, PAGE, PAGE, PAGE, PAGE, EVENT, EVENT, EVENT, PAGE, PAGE, EVENT, PAGE, EVENT, PAGE, EVENT, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, EVENT, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE][19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19][21, 21, 21, 22, 22, 22, 23, 23, 23, 23, 23, 23, 23, 23, 24, 24, 24, 24, 25, 25, 26, 26, 26, 26, 27, 27, 27, 27, 31, 31, 31, 33, 33, 33]NaN0
55377Not Socially EngagedReferral201612315940320291403246393632.0937.01.0NaN36570000.0United States(direct)(none)TrueChromeMacintoshdesktop[PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, EVENT, EVENT, PAGE, PAGE, PAGE, EVENT, EVENT, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE][7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7][41, 41, 43, 43, 44, 44, 45, 45, 45, 46, 46, 46, 46, 46, 46, 46, 46, 46, 46, 47, 47, 48, 48, 50, 50, 51, 52, 52, 52, 53, 54, 55, 55, 55, 56, 56]NaN0
55378Not Socially EngagedReferral2016123176167892038077823113936.01349.01.01.087980000.0United States(direct)(none)NaNChromeMacintoshdesktop[PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, EVENT, EVENT, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, EVENT, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE][22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22, 22][11, 11, 12, 12, 13, 13, 14, 14, 18, 18, 18, 19, 19, 19, 19, 19, 19, 20, 20, 20, 21, 21, 24, 24, 24, 25, 25, 25, 26, 27, 28, 28, 29, 29, 30, 31, 33, 34, 34]NaN0
55379Not Socially EngagedOrganic Search2016123128142535797694921093930.01462.01.01.064990000.0United States(direct)(none)NaNChromeMacintoshdesktop[PAGE, PAGE, EVENT, EVENT, PAGE, EVENT, PAGE, EVENT, PAGE, PAGE, EVENT, PAGE, EVENT, PAGE, PAGE, EVENT, PAGE, EVENT, PAGE, PAGE, PAGE, PAGE, EVENT, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE][10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10][19, 19, 19, 19, 19, 20, 20, 20, 20, 20, 20, 20, 20, 20, 23, 24, 24, 24, 24, 27, 28, 29, 29, 29, 30, 30, 30, 32, 32, 33, 33, 36, 37, 37, 37, 38, 42, 43, 43]NaN0
55380Not Socially EngagedReferral2016123169760576827935766467250.01377.01.0NaN279880000.0United States(direct)(none)TrueChromeMacintoshdesktop[PAGE, PAGE, PAGE, PAGE, EVENT, EVENT, PAGE, PAGE, PAGE, PAGE, EVENT, PAGE, EVENT, PAGE, EVENT, PAGE, PAGE, PAGE, PAGE, EVENT, PAGE, EVENT, PAGE, EVENT, PAGE, PAGE, EVENT, PAGE, EVENT, PAGE, EVENT, PAGE, PAGE, PAGE, EVENT, PAGE, EVENT, PAGE, PAGE, PAGE, EVENT, PAGE, EVENT, PAGE, EVENT, PAGE, PAGE, EVENT, PAGE, EVENT, PAGE, PAGE, EVENT, PAGE, EVENT, PAGE, EVENT, PAGE, PAGE, PAGE, PAGE, PAGE, EVENT, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE, PAGE][19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19][23, 24, 24, 27, 27, 27, 27, 28, 28, 28, 28, 28, 28, 28, 29, 29, 29, 29, 29, 30, 30, 30, 30, 30, 30, 30, 31, 31, 31, 31, 32, 32, 32, 32, 33, 33, 33, 33, 35, 35, 35, 35, 35, 36, 36, 36, 36, 37, 37, 37, 37, 37, 37, 37, 37, 37, 38, 38, 40, 40, 41, 41, 41, 41, 42, 43, 44, 45, 45, 46, 46, 46]NaN0